Automatic Creation of Domain Templates
نویسندگان
چکیده
Recently, many Natural Language Processing (NLP) applications have improved the quality of their output by using various machine learning techniques to mine Information Extraction (IE) patterns for capturing information from the input text. Currently, to mine IE patterns one should know in advance the type of the information that should be captured by these patterns. In this work we propose a novel methodology for corpus analysis based on cross-examination of several document collections representing different instances of the same domain. We show that this methodology can be used for automatic domain template creation. As the problem of automatic domain template creation is rather new, there is no well-defined procedure for the evaluation of the domain template quality. Thus, we propose a methodology for identifying what information should be present in the template. Using this information we evaluate the automatically created domain templates through the text snippets retrieved according to the created templates.
منابع مشابه
Automatic Creation of Domain Templates
Recently, many Natural Language Processing applications have improved the quality of their output by using various machine learning techniques to mine Information Extraction patterns for capturing information from the input text. Currently, to mine IE patterns one should know in advance the type of the information which should be captured by these patterns. In this work we propose a novel metho...
متن کاملProviding a structural model for psychological problems based on disconnection and rejection domain and negative automatic thoughts with mediating role of experimental avoidance
Introduction: Psychological problems are the result of a person's interaction with the environment and include behaviors that cause social conflicts, dissatisfaction and individual unhappiness. The present study aimed to provide a structural model for psychological problems based on disconnection and rejection domain and negative automatic thoughts with mediating role of experimental avoidance....
متن کاملAutomatic Creation of CV Templates for Formant Type Speech Synthesis Based on HMM-Based Segmentation and Syllable Boundary Detection
An automatic method to create CV forrnantsource templates from continuous speech corpus is proposedfor speechsynthesis,where the boundaries of the CV templates are decided on the basis of the Mahalanobis distance. The synthetic experiments have proved the methodto be useful.
متن کاملAutomatic Extraction of Briefing Templates
An approach to solving the problem of automatic briefing generation from non-textual events can be segmenting the task into two major steps, namely, extraction of briefing templates and learning aggregators that collate information from events and automatically fill up the templates. In this paper, we describe two novel unsupervised approaches for extracting briefing templates from human writte...
متن کاملAutomatic Modulation Recognition using the Discrete Wavelet Transform
An Automatic Modulation Recognition (AMR) process using the Discrete Wavelet Transform (DWT) is presented in this work. The AMR algorithm involves the use of wavelet domain signal templates derived from digitally modulated signals that are used to transmit binary data. The signal templates, locally stored in a receiver, are cross-correlated with the incoming noisy, received signal after it has ...
متن کامل